NANYANG TECHNOLOGICAL UNIVERSITY SCHOOL OF HUMANITIES AND SOCIAL SCIENCES Creating derivational morphology links in Wordnet Bahasa
نویسنده
چکیده
Derivational morphology links are created for the Wordnet Bahasa, a combined Indonesian and Malay online lexical dictionary (Nurril Hirfana, Suerya, & Bond, 2011). The focus was to link root words to affixed words as affixation is one of the more apparent word formation processes in Bahasa Melayu. MorphInd, an Indonesian morphological analyser (Larasati, Kubon, & Zeman, 2011), is used to breakdown affixed words into their root form and affixes. Using Python 2.7 with NLTK, a raw mapping is done by matching the analysed words to the root forms. The derivational links in the Princeton Wordnet (PWN) are used to verify if the same links exist in Wordnet Bahasa. Redundant links are removed by the Part-of-Speech (POS) filter and Semantic Super Type filter. The links are then disambiguated using the Lesk algorithm, where the definitions and other components of the sense (e.g. hypernyms, hyponyms and examples) are compared for their similarity. However, the disambiguation process is rendered ineffective because of the high amount of errors still existing in Wordnet Bahasa. The derivational links are released as a separate file and only those with similar derivational links to PWN are added into Wordnet Bahasa. Erroneous entries that were identified using MorphInd are removed from Wordnet Bahasa.
منابع مشابه
A hybrid refinement scheme for intra- and cross-corpora phonetic segmentation
A hybrid refinement scheme for intraand crosscorpora phonetic segmentation Sixuan Zhao a,∗, Ing Yann Soon a, Soo Ngee Koh a, Kang Kwong Luke b a School of Electrical & Electronic Engineering, Nanyang Technological University, 50, Nanyang Avenue, Singapore 639798, Singapore b School of Humanities & Social Sciences, Nanyang Technological University, 50, Nanyang Avenue, Singapore 639798, Singapore
متن کاملiNTErSuBJECTivE CONSENSuS aNd ThE maiNTENaNCE Of NOrmaTivE SharEd rEaliTy
422 Ching Wan, Division of Psychology, Nanyang Technological University; Carlos J. Torelli, Carlson School of Management, University of Minnesota; Chi-yue Chiu, Nanyang Business School, Nanyang Technological University. The present research was partly supported by a grant (BCS-0743119) the National Science Foundation awarded to the third author. Correspondence concerning this article should be ...
متن کاملPredicting Depressive Symptoms a ProsPeCtive eXamination oF dePressive symPtomoloGy: understandinG tHe relationsHiP betWeen neGative events, selF-esteem, and neurotiCism
438 The research reported in this article was supported by a McGill University Social Sciences and Humanities Student Research Grant Awarded to Randy P. Auerbach. Randy P. Auerbach, Harvard Medical School – McLean Hospital, Department of Psychiatry; John R. Z. Abela, Department of Psychology, Rutgers University; Moon-Ho Ringo Ho, Division of Psychology, School of Humanities and Social Sciences,...
متن کاملLanguage Lateralization Explained by the Generalized Fractional Anisotropy in the Auditory Nerve and the Corpus Collosum as Studied Using Diffusion Spectrum Imaging Tractography and Fmri
K. Matsuo, Y-C. Lo, F-C. Yeh, Y-H. Wu, S-H. A. Chen, and W-Y. I. Tseng Center for Optoelectronic Biomedicine, National Taiwan University College of Medicine, Taipei, Taiwan, Institute of Biomedical Engineering, National Taiwan University, Taipei, Taiwan, Department of Biomedical Engineering, Carnegie Mellon University, Pittsburgh, PA, United States, Department of Medicine, National Taiwan Unive...
متن کاملImproved variance estimation of maximum likelihood estimators in stable first-order dynamic regression models
In dynamic regression models conditional maximum likelihood (least-squares) coeffi cient and variance estimators are biased. From expansions of the coeffi cient variance and its estimator we obtain an approximation to the bias in variance estimation and a bias corrected variance estimator, for both the standard and a bias corrected coeffi cient estimator. These enable a comparison of their mean...
متن کامل